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SEQUENCE LISTING 



( 1 )INFORMATI0N FOR SEQ ID N0:1: / 
(i)SEQUENCE CHARACTERISTICS: / 

( A) LENGTH:756 / 

( B ) TYPE : amino acid / 
( D)TOPOLOGY: linear / 

(ii) MOLECULE TYPE: peptide / 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO:l: / 

Pro Ala Ser Thr Tyr Arg Leu Gin lie Ser Ala Gl/U Phe Thr Leu Phe 

1 5 10 / 15 

Asp Ala Ala Arg lie Val Pro Tyr Leu His Arg/Leu Gly Ala Asp Trp 

20 25 / 30 

Leu Tyr Leu Ser Pro Leu Leu Glu Ser Glu S^r Gly Ser Ser His Gly 

35 40 / 45 

Tyr Asp Val Val Asp His Ser Arg Val Aso/Ala Ala Arg Gly Gly Pro 

50 55 / 60 

Glu Gly Leu Ala Glu Leu Ser Arg Ala Ala His Glu Arg Gly Met Gly 
65 70 / 75 80 

Val Val Val Asp lie Val Pro Asn His^ Val Gly Val Ala Thr Pro Lys 

85 / 90 95 

Ala Asn Arg Trp Trp Trp Asp Val Jfcfeu Ala Arg Gly Gin Arg Ser Glu 

100 /l05 110 

Tyr Ala Asp Tyr Phe Asp lie As/ Trp Glu Phe Gly Gly Gly Arg Leu 

115 125 
Arg Leu Pro Val Leu Gly Asp ^ly Pro Asp Glu Leu Asp Ala Leu Arg 

130 135/ 140 

Val Asp Gly Asp Glu Leu Va^ Tyr Tyr Glu His Arg Phe Pro lie Ala 
145 150 / 155 160 

Glu Gly Thr Gly Gly Gly /rhr Pro Arg Glu Val His Asp Arg Gin His 

165 / 170 175 

Tyr Glu Leu Met Ser Tr^ Arg Arg Ala Asp His Asp Leu Asn Tyr Arg 

180 / 185 190 

Arg Phe Phe Ala Val /Asn Thr Leu Ala Ala Val Arg Val Glu Asp Pro 

195 / 200 205 

Arg Val Phe Asp As^ Thr His Arg Glu lie Gly Arg Trp lie Ala Glu 

210 / 215 220 

Gly Leu Val Asp /Gly Leu Arg Val Asp His Pro Asp Gly Leu Arg Ala 
225 / 230 235 240 

Pro Gly Asp Tyr Leu Arg Arg Leu Ala Glu Leu Ala Gin Gly Arg Pro 

/ 245 250 255 

lie Trp Val dlu Lys lie lie Glu Gly Asp Glu Arg Met Pro Pro Gin 

/26O 265 270 

Trp Pro 11/ Ala Gly Thr Thr Gly Tyr Asp Ala Leu Ala Gly lie Asp 

275 280 285 

Arg Val Lfeu Val Asp Pro Ala Gly Glu His Pro Leu Thr Gin lie Val 

290 / 295 300 

Asp Glu/ Ala Ala Gly Ser Pro Arg Arg Trp Ala Glu Leu Val Pro Glu 
305 7 

Arg lVs Arg Ala Val Ala Arg Gly lie Leu Asn Ser Glu lie Arg Arg 

/ 325 330 335 

Val Ala Arg Glu Leu Gly Glu Val Ala Gly Asp Val Glu Asp Ala Leu 

/ 340 345 350 

VaV Glu lie Ala Ala Ala Leu Ser Val Tyr Arg Ser Tyr Leu Pro Phe 
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355 360 
Gly Arg Glu His Leu Asp Glu Ala Val 

370 375 
Pro Gin Leu Glu Ala Asp Leu Ala Ala 
385 390 
Pro Gly Asn Pro Ala Ala Leu Arg Phe 

405 

Met Ala Lys Gly Val Glu Asp Asn Ala 
420 425 
Thr Ser Leu Thr Glu Val Gly Gly Asp 

435 440 
Ala Ala Ala Phe His Ala Ala Gin Arg 

450 455 
Glu Ser Met Thr Thr Leu Thr Thr His 
465 470 
Thr Arg Ala Arg lie Thr Ala Leu Ala 

485 

Arg Phe Leu Thr Glu Val Gly Gly Leu 
500 505 
Leu Glu Asn Leu lie Trp Gin Ala lie 

515 520 
Arg Glu Arg Leu Glu Ala Tyr Ala Leu 

530 535 
Glu Ser Thr Asp Trp lie Asp Gly A 
545 550 
Thr Arg Leu Val Thr Val Ala Val yfelu 

565 

Leu Glu Arg Leu Val Asp Glu Le6 Thr 
580 / 585 

Leu Ala Ala Lys Leu Leu Gin E^eu Leu 

595 /600 
Tyr -Gin Gly Thr Glu Arg Trpr Asp Arg 

610 6- 
Arg Arg Pro Val Asp Phe A^a Ala Ala 
625 630 
Asp Gly Gly Trp Arg Pro /Pro Val Asp 

645 

Leu Val Val Ser Arg AlA Leu Arg Leu 
660 / 665 

Phe Thr Ala Tyr His Fro Val Thr Ala 

675 / 680 

Leu He Gly Phe Asp/ Arg Gly Gly Ala 

690 / 695 

Pro Leu Gly Leu Al/a Ala Ala Gly Gly 
705 / 710 

Val Gly Glu Arg Ser Leu Arg Asp Glu 

/725 

Gly Ala Ala Arg/ Val Ala Glu Leu Phe 

745 

Leu Val Glu T 

^ 755 

(2)INFORMATl6n for SEQ id NO: 2: 

(i)SEQl/ENCE CHARACTERISTICS: 
(^) LENGTH: 6 

0 TYPE: amino acid 



Ala 

Val 

Gin 
410 
Phe 

Pro 

Asp 

Asp 

Glu 
490 
II 

V^l 

Lys 

Pro 

Glu 
570 
Ala 

Ala 

Ser 

Ser 

Glu 
650 
Arg 

Arg 

He 

Trp 



Ala 

Gly 
395 
Gin 

Tyr 

Ser 

Arg 

Th: 
4/5 
.a 

Gly 

Gly 

Ala 

Ala 
555 
Pro 

Ala 

Pro 

Leu 

Glu 
635 
Thr 

Arg 

Gly 

Ala 



Leu 
730 
Ala 



Gly 
715 
Thr 

Asp 



Ala 
380 
Ala 

Thr 

Arg 

Lei 

aa 

^460 
Lys 

Pro 

Thr 

Ala 

Ala 
540 
Phe 

Leu 

Gly 

Gly 

Val 
620 
Leu 

Gly 

Asp 

Ala 

Leu 
700 
Asp 

Gly 

Tyr 



365 
Gin 



Pyr 

Phe 
445 
Ala 

Arg 

Glu 

Gly 

Trp 
525 
Arg 

Glu 

Val 

Tyr 

Thr 
605 
Asp 

Leu 

Ala 

Arg 

Gin 
685 
Ala 

Thr 

Arg 

Pro 



Ali 

feu 

Gly 

Pro 
430 
Ala 

Arg 

Ser 

Arg 

Asp 
510 
Pro 

Glu 

Glu 

His 

Ser 
590 
Pro 

Pro 

Asp 

Val 

Pro 
670 
Ala 

Thr 

Val 

Glu 

Val 
750 



Ala 

Ala 

Met 
415 
Arg 

He 

Leu 

Glu 

Trp 
495 
Arg 

Ala 

Ala 

Arg 

Glu 
575 
Asn 

Asp 

Asp 

Arg 

Lys 
655 
Glu 

Glu 

Arg 

Val 

Ala 
735 
Ala 



Ala 

Asp 
400 
He 

Leu 

Asp 

Pro 

Asp 
480 
Arg 

Val 

Ser 

Gly 

Leu 
560 
Leu 

Gly 

Val 

Asn 

Leu 
640 
Thr 

Leu 

His 

Leu 

Asp 
720 
Arg 

Leu 
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(D)TOPOLOGY: linear 
(ii)MOLECULE TYPE : peptide 

(v)FRAGMENT TYPE : internal fragment 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 2 : 

Asp lie Val Pro Asn His 
1 5 

(3 ) INFORMATION FOR SEQ ID NO : 3 : 

(i)SEQUENCE CHARACTERISTICS: 

(A) LENGTH:6 

( B ) TYPE : amino acid 
( D ) TOPOLOGY: linear 

( ii )iyiOLECULE TYPE: peptide 

(v)FRAGMENT TYPE : internal fragmeni 
(xi) SEQUENCE DESCRIPTION : SEQ ID NC/:3: 



Gly 
1 



Thr Thr 



Tyr 
5 



Ser Leu Val As 

Ser Glu Leu L^u 
10 




(4)INF0RMATI0N FOR SEQ ID NO : 4 : 

( i ) SEQUENCE CHARACTERISTICS/ 

(A) LENGTH: 20 

( B ) TYPE : amino acid 
( D ) TOPOLOGY : linear 

(ii)MOLECULE TYPE : peptide. 

(v)FRAGMENT TYPE : N- terminal fragment 
(xi) SEQUENCE DESCRIPTIOl/: SEQ ID NO: 4: 



Pro Ala Ser Thr Tyr Arg 

1 5 
Asp Ala Ala Arg 
20 



iu Gin lie Ser Ala Glu Phe Thr Leu Phe 
10 15 



( 5 )INFORMATION FOR SEO ID NO : 5 : 

(i)SEQUENCE CHARACTERISTICS: 

(A) LENGTh/20 

( B ) TYPE : amino acid 
( D ) TOPOLjOGY : linear 

(ii) MOLECULE TYPE: peptide 

( V ) FRAGMENT /TYPE : internal fragment 
(xi) SEQUENCE/ DESCRIPTION: SEQ ID NO : 5 : 



Pro Asp Asn Arg Arg Pro Val Asp Phe Ala Ala Ala 
5 10 ' 15 



( 6 )INFORMATI/ON FOR SEQ ID NO: 6: 

(i)SEQlJENCE CHARACTERISTICS: 
/A) LENGTH: 20 
B ) TYPE : amino acid 
^(D) TOPOLOGY: linear 
(ii)MpLECULE TYPE: peptide 

(v)FRAGMENT TYPE : internal fragment 
(xi)SEQUENCE DESCRIPTION : SEQ ID NO : 6 : 
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Ala Asn Arg Trp Trp Trp Asp Val Leu Ala Arg Gly Gin /Arg Ser Glu 

1 5 10/15 

Tyr Ala Asp Tyr 
20 

(7 )INFORMATION FOR SEQ ID NO : 7 : 

(i)SEQUENCE CHARACTERISTICS: 

( A) LENGTH:2268 

( B ) TYPE : nucleic acid 

( C ) strandedness : double 

(D) TOPOLOGY: linear 
(xi)SEQUENCE DESCRIPTION : SEQ ID NO; 

CCCGCCAGTA CCTACCGCCT TCAGATCTCG GCGGA)^TTCA CCCTCTTCGA CGCGGCGCGC 60 
ATCGTGCCCT ACCTGCACCG CCTCGGCGCC GACT(SGCTGT ACCTCTCGCC GCTGCTCGAG 120 
TCCGAGTCGG GCTCCTCGCA CGGCTACGAC GTGJSTCGACC ACTCCCGCGT CGACGCCGCC 180 
CGCGGCGGGC CGGAGGGGCT CGCCGAGCTC Tc/cGTGCGG CGCACGAGCG CGGCATGGGC 240 
GTCGTCGTCG ACATCGTGCC CAACCACGTC GJSCGTCGCGA CGCCGAAGGC GAACCGCTGG 300 
TGGTGGGACG TTCTGGCCCG TGGACAGCGG /cGGAGTACG CCGACTACTT CGACATCGAC 360 
TGGGAGTTCG GCGGCGGCAG GCTGCGCCTG/CCCGTGCTCG GCGACGGCCC CGACGAGCTC 420 
GACGCGCTGA GAGTGGATGG CGACGAGCT(2 GTCTACTACG AGCACCGCTT CCCGATCGCC 480 
GAGGGCACCG GCGGCGGCAC CCCGCGCG5(G GTGCACGACC GGCAGCACTA CGAGCTGATG 540 
TCGTGGCGGC GGGCCGACCA CGACCTC^C TACCGCCGCT TCTTCGCCGT GAACACGCTC 600 
GCCGCCGTAC GCGTCGAAGA CCCGCGOGTG TTCGACGACA CCCACCGCGA GATCGGCCGC 660 
TGGATCGCCG AGGGCCTCGT CGACGGCCTG CGCGTCGACC ACCCCGACGG GCTGCGCGCC 720 
CCCGGCGACT ACCTGCGCCG TCTCG^CGAG CTCGCCCAAG GC AGGCCGAT CTGGGTCGAG 780 
AAGATCATCG AGGGCGACGA GCGGATGCCC CCGCAGTGGC CCATCGCCGG CACCACCGGC 840 
TACGACGCGC TGGCCGGGAT CGAGCGGGTG CTCGTCGACC CCGCGGGCGA GCATCCGCTC 900 
ACCCAGATCG TCGACGAGGC GGCAGGCAGC CCCCGGCGCT GGGCCGAGCT GGTTCCCGAG 9 60 
CGCAAGCGGG CCGTCGCCCG CGGCATCCTG AACTCCGAGA TCCGCCGCGT CGCCCGCGAA 1020 
CTCGGAGAGG TCGCCGGCGA CGTCGAAGAC GCGCTCGTCG AGATCGCCGC CGCCCTGTCC 1080 
GTCTACCGCA GCTACCTGCC GTTCGGGCGC GAGCACCTCG ACGAAGCCGT GGCCGCCGCG 1140 
CAGGCCGCAG CCCCCCAGCTyCGAGGCCGAC CTCGCCGCCG TCGGCGCAGC GCTCGCCGAC 1200 
CCGGGCAACC CCGCCGCGCt/cCGCTTCCAG CAGACCAGCG GCATGATCAT GGCCAAGGGC 1260 
GTCGAGGACA ACGCGTTCTA CCGCTACCCC CGGCTCACCT CGCTGACCGA GGTCGGGGGA 1320 
GACCCGAGCC TGTTCGCGA^T CGACGCGGCC GCCTTCCACG CGGCGCAGCG CGACCGCGCC 1380 
GCCCGGCTGC CCGAGTCGAT GACGACGCTG ACCACCCACG ACACCAAGCG CAGCGAAGAC 1440 
ACCCGGGCGC GGATCACCGC GCTCGCCGAG GCCCCCGAAC GCTGGCGGCG CTTCCTGACC 1500 
GAGGTCGGCG GGCTCATCGG AACGGGCGAC CGGGTGCTGG AGAACCTGAT CTGGCAGGCG 15 60 
ATCGTCGGCG CGTGGCCGGC GAGCCGGGAG CGGCTCGAGG CCTACGCGCT GAAGGCCGCG 1620 
CGCGAAGCCG GCGAG;^CGAC CGACTGGATC GACGGCGACC CCGCGTTCGA AGAGCGGCTG 1680 
ACCCGCCTGG TCACGGTCGC CGTCGAGGAG CCGCTCGTGC ACGAGCTGCT CGAGCGGCTC 1740 
GTCGACGAGC TGACGGCGGC CGGGTACTCC AACGGCCTCG CGGCGAAGCT GCTGCAGCTG 1800 
CTCGCCGCCG GAAGCCCCGA CGTGTACCAG GGCACGGAAC GCTGGGACCG GTCGCTGGTG 18 60 
GACCCGGACA ACCGTCGCCC CGTGGATTTC GCCGCGGCAT CCGAGCTGCT CGACCGCCTC 19 20 
GACGGCGGCT GGCGGCCGCC CGTCGAGGAG ACCGGCGCGG TCAAGACGCT CGTCGTCTCC 19 80 
CGCGCGCTGC GcfcTGCGCCG CGACCGGCCC GAGCTGTTCA CCGCGTACCA CCCGGTCACG 2040 
GCGCGCGGCG CGCAGGCCGA GCACCTGATC GGCTTCGACC GCGGCGGCGC GATCGCCCTG 2100 
GCCACCCGCC TbcCGCTCGG CCTCGGCGCC GCAGGCGGCT GGGGCGACAC GGTCGTCGAC 2160 
GTCGGCGAGC C^GAGCCTGCG CGACGAGCTG ACCGGCCGCG AGGCCCGCGG AGCGGCGCGC 2220 
GTGGCCGAGT /tGTTCGCCGA CTACCCCGTC GCCCTGCTGG TGGAGACA 2268 

(8)INF0RMA'TI0N for SEQ ID NO : 8 : 

(i)SEQUENCE CHARACTERISTICS: 

(A) LENGTH:28 

( B ) TYPE : nucleic acid 
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( C ) strandedness : double 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 8 

TTTTTTAATA AAATCAGGAG GAAAAAAT 

(9 )INFORMATION FOR SEQ ID NO : 9 : 

(i)SEQUENCE CHARACTERISTICS: 

( A) LENGTH: 575 

( B ) TYPE : amino acid 
( D ) TOPOLOGY : linear 

( ii ) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 9: 

Met Asn Arg Arg Phe Pro Val Trp Ala Pro 
15 1" 
Leu Val Val Gly Gin Gly Arg Ala Glu 
20 25 
Asn Gly Trp Trp Ala Leu Gin Gin Pro/Trp 

35 40 
Val Asp Tyr Gly Tyr Leu Val Asp G^ Lys 

50 55 
Arg Ser Leu Arg Gin Pro Arg Gly yVal His 

65 70 
Asp Pro Ala Arg Tyr Ala Trp oy/ Asp Asp 

85 / 90 

Leu Thr Gly Ala Val lie Tyr ^lu Leu His 

100 / 105 

Glu Gly Thr Leu Asp Ser Alaf lie Arg Arg 

115 / 120 

Leu Gly Val Asp Ala Val Glu Leu Leu Pro 

130 a.35 
Thr His Gly Trp Gly Tyr/Asp Gly Val Leu 
145 1 

Pro Tyr Gly Gly Pro G3j6 Ala Tyr Gin Arg 

165 / 170 
Ala Arg Gly Leu Ala Val Val Gin Asp Val 

180 / 185 
Pro Ser Gly Asn His/ Leu Pro Asp Phe Gly 

195 / 200 

Ala Ala Asn Thr Tr^ Gly Asp Ala Leu Asn 

210 / 215 

Asp Glu Val Arg Arg Tyr lie lie Asp Asn 
225 / 230 

Asp Met His Ala/Asp Gly Leu Arg Leu Asp 

245 250 
Asp Ala Arg AlA Leu His Leu Leu Glu Glu 

260 265 
Glu Leu Ala gLy Glu Leu Gly Arg Pro Leu 

275 7 280 
Asp Leu Asn Asp Pro Lys Leu lie Arg Ser 

290 / 295 

Gly Leu Asp/ Ala Gin Trp Asp Asp Asp Val 
305 / 310 

Asn Val Thf Gly Glu Thr Val Gly Tyr Tyr 

325 330 



In Ala Ala Gin Val Thr 

15 

Pro Leu Thr Arg Asp Glu 
30 

Asp Gly Gly Pro Asp Leu 
45 

Gly Pro Phe Ala Asp Pro 
60 

Glu Leu Gly Arg Glu Phe 
75 80 
Gly Trp Arg Gly Arg Asp 

95 

Val Gly Thr Phe Thr Pro 
110 

Leu Asp His Leu Val Arg 
125 

Val Asn Ala Phe Asn Gly 
140 

Trp Tyr Ala Val His Glu 
155 160 
Phe Val Asp Ala Cys His 

175 

Val Tyr Asn His Leu Gly 
190 

Pro Tyr Leu Gly Ser Gly 
205 

Leu Asp Gly Pro Leu Ser 
220 

Ala Val Tyr Trp Leu Arg 
235 240 
Ala Val His Ala Leu Arg 

255 

Leu Ala Ala Arg Val Asp 
270 

Thr Leu lie Ala Glu Ser 
285 

Arg Ala Ala His Gly Tyr 
300 

His His Ala Val His Ala 
315 320 
Ala Asp Phe Gly Gly Leu 

335 



28 
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• 



Gly 


Ala 


Leu 


Val 


Lys 


Val 


Phe 


Gin 


Arg 


Gly 


Trp 


Phe 


His 




Gly 


Thr 






340 








345 
















Trp 


Ser 


Ser 


Phe 


Arg 


Glu 


Arg 


His 


His 


Gly 


Arg 


Pro 


Leu^ 


Asp 


Pro 


Asp 




355 








360 


















lie 


Pro 


Phe 


Arg 


Arg 


Leu 


Val 


Ala 


Phe 


Ala 


Gin 


Asp 


/His 


Asp 


Gin 


Val 




370 








375 




















Gly 


Asn 


Arg 


Ala 


Val 


Gly 


Asp 


Arg 


Met 


Ser 


Ala 




Val 


Gly 


Glu 


Gly 


385 








390 










395 










400 


Ser 


Leu 


Ala 


Ala 


Ala 
405 


Ala 


Ala 


Leu 


Val 


Leu 
410 


Lev/ 


Gly 


Pro 


Phe 


Thr 
415 


Pro 


Met 


Leu 


Phe 


Met 
420 


Gly 


Glu 


Glu 


Trp 


Gly 
425 


Ala 




Thr 


Pro 


Trp 
430 


Gin 


Phe 


Phe 


Thr 


Ser 
435 


His 


Pro 


Glu 


Pro 


Glu 
440 


Leu 


Giy 


Glu 


Ala 


Thr 
445 


Ala 


Arg 


Gly 


Arg 


lie 


Ala 


Glu 


Phe 


Ala 


Arg 


Met 


Gly 




Asp 


Pro 


Ala 


Val 


Val 


Pro 


450 










455 










460 










Asp 


Pro 


Gin 


Asp 


Pro 


Ala 


Thr 


Phe 


Al^ 


Arg 


Ser 


His 


Leu 


Asp 


Trp 


Ser 


465 








470 










475 










480 


Glu 


Pro 


Glu 


Arg 


Glu 


Pro 


His 


Ala 


Oly 


Leu 


Leu 


Ala 


Phe 


Tyr 


Thr 


Asp 








485 










490 










495 




Leu 


lie 


Ala 


Leu 


Arg 


Arg 


Glu 


Lei/ 


Pro 


Val 


Asp 


Ala 


Pro 


Ala 


Arg 


Glu 








500 








505 










510 






Val 


Asp 


Ala 


Asp 


Glu 


Ala 


Arg 


Giy 


Val 


Phe 


Ala 


Phe 


Ser 


Arg 


Gly 


Pro 




515 










5520 










525 








Leu 


Arg 
530 


Val 


Thr 


Val 


Ala 


Leu 
535^ 


/Arg 


Pro 


Gly 


Pro 


Val 
540 


Gly 


Val 


Pro 


Glu 


His 


Gly 


Gly 


Leu 


Val 


Leu 


Al4 


Tyr 


Gly 


Glu 


Val 


Arg 


Ala 


Gly 


Ala 


Ala 


545 






550 










555 










560 


Gly 


Leu 


His 


Leu 


Asp 


Gly 




Gly 


Ala 


Ala 


lie 


Val 


Arg 


Leu 


Glu 










565 










570 










575 





( 10 ) INFORMATION FOR SEQ ID NO: 10: 

( i ) SEQUENCE CHARACTERISTICS : 

( A) LENGTH: 6r 

(B) TYPE: ama.no acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 
(v)FRAGMENT jTYPE : internal fragment 

(xi)SEQUENCE/DESCRIPTION:SEQ ID NO: 10: 

Trp Gly Tyr Asp yGly Val 
1 / 5 

( 11 ) INFORMATION FOR SEQ ID NO: 11: 

( i ) SEQUENCE CHARACTERISTICS : 
( A)£eNGTH: 6 
( B /type : amino acid 
( r/) TOPOLOGY : linear 
(ii)MOLECULE TYPE: peptide 

(v)FRAjGMENT type : internal fragment 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 11: 

Asp Val V^l Tyr Asn His 
1 / 5 

( 12) INFORMATION FOR SEQ ID NO: 12: 




(i) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 7 

( B) TYPE : amino acid 
(D)TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 
(v)FRAGMENT TYPE : internal fragment 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12: 

Arg Leu Asp Ala Val His Ala 
1 5 

( 13 ) INFORMATION FOR SEQ ID NO: 13: 

( i ) SEQUENCE CHARACTERISTICS : 

( A) LENGTH:7 

( B ) TYPE : amino acid 
( D)TOPOLOGY: linear 

(ii)MOLECULE TYPE: peptide 

(v)FRAGMENT TYPE : internal fragment 
(xi) SEQUENCE DESCRIPTION : SEQ j?D NO: 13 

lie Ala Glu Ser Asp Leu Asn 
1 5 

( 14 ) INFORMATION FOR SEQ ID No/l4: 

(i)SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 
( B )TYPE : amino aciJ 
(D)TOPOLOGY: line/r 
(ii) MOLECULE TYPE: peptide 

(v)FRAGMENT TYPE : N- tferminal fragment 
(xi) SEQUENCE DESCRIPa'ION : SEQ ID NO: 14: 

Met Asn Arg Arg Phe Pre/ Val Trp Ala Pro Gin Ala Ala Gin Val Thr 

1 5 / 10 15 

Leu Val Val Gly 
20 

( 15 ) INFOEmATION FOr/sEQ ID NO: 15 

(i)SEQUENCE CHARACTERISTICS: 

(A) LENGTH:20 

( B ) TYPE yamino acid 
( D ) TOPOLOGY : linear 

( ii ) MOLECULE /TYPE : peptide 

( V ) fragment/ TYPE : internal fragment 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15: 

Ser Arg Ala Aid His Gly Tyr Gly Leu Asp Ala Gin Trp Asp Asp Asp 

1/5 10 15 

Val His His A^a 
iO 

( 16 ) INFORMATION FOR SEQ ID NO: 16: 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 

( B ) TYPE : amino acid 
(h)TOPOLOGY: linear 
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m 



(ii)MOLECULE TYPE: peptide X / 

(v)FRAGMENT TYPE : internal fragment / / 

(xi)SEQUENCE DESCRIPTION : SEQ ID N0:16: / / 

Asp Glu Asn Gly Trp Trp Ala Leu Gin Gin Pro T^p/Asp Gly Gly Pro 

1 5 10 / / 15 

Asp Leu Val Asp / / 



( 17 ) INFORMATION FOR SEQ ID NO: 17: // 
(i)SEQUENCE CHARACTERISTICS: // 

( A) LENGTH: 1725 

(B) TYPE: nucleic acid / 

( C ) strandedness : double / 

( D ) TOPOLOGY: linear / 
(xi) SEQUENCE DESCRIPTION : SEQ ipyN0:17: 

ATGAACCGAC GATTCCCGGT CTGGGCGCCC G^GCCGCGC AGGTGACGCT CGTCGTGGGC 60 

CAAGGCCGCG CCGAACTCCC GCTGACCCGC /GACGAGAACG GATGGTGGGC TCTTCAGCAG 120 

CCGTGGGACG GCGGCCCCGA CCTCGTCGAG /aCGGCTACC TCGTCGACGG CAAGGGCCCC 180 

TTCGCCGACC CGCGGTCGCT GCGGCAGCGG/CGCGGCGTGC ACGAGCTCGG CCGCGAATTC 240 

GACCCCGCCC GCTACGCGTG GGGCGACGAG GGATGGCGCG GCCGAGACCT CACCGGAGCC 300 

GTGATCTACG AACTGCACGT CGGCACC/f'PC ACCCCTGAGG GAACGCTGGA CAGCGCCATC 3 60 

CGTCGCCTCG ACCACCTGGT GCGCCTjCGGC GTCGACGCGG TCGAGCTGCT GCCCGTCAAC 420 

GCGTTCAACG GCACCCACGG CTGGGGCTAC GACGGGGTGC TCTGGTACGC GGTGCACGAG 480 

CCCTACGGCG GCCCGGAGGC GTACGJA^CGC TTCGTCGACG CCTGCCACGC CCGCGGCCTC 540 

GCCGTCGTGC AGGACGTCGT CTAG^ACCAC CTGGGCCCGA GCGGCAACCA CCTGCCCGAC 600 

TTCGGCCCCT ACCTCGGGTC GGGCGfCGCC AACACCTGGG GCGACGCGCT GAACCTCGAC 660 

GGGCCGCTCT CCGACGAGGT GCGGGGGTAC ATCATCGACA ACGCGGTGTA CTGGCTGCGC 720 

GACATGCACG CCGACGGGCT Gg'gGGTCGAC GCCGTGCACG CGCTGCGCGA CGCCCGCGCG 780 

CTGCACCTGC TCGAAGAGCT obcCGCCCGC GTCGACGAGC TGGCGGGCGA GCTCGGCCGG 840 

CCGCTGACGC TCATCGCCGA GAGCGACCTG AACGACCCGA AGCTGATCCG CTCCCGCGCG 900 
GCGCACGGCT ACGGCCTCGa/cGCCCAGTGG GACGACGACG TGCACCACGC GGTGCACGCC 9 60 

AACGTGACCG GCGAGACCGT CGGCTACTAC GCCGACTTCG GCGGGCTCGG CGCCCTCGTC 1020 

AAGGTGTTCC AGCGCGGCTfe GTTCCACGAC GGCACCTGGT CGAGCTTCCG CGAGCGGCAC 1080 

CACGGCCGGC CGCTCGACCc/cGACATCCCG TTCCGCCGGC TCGTCGCCTT CGCGCAGGAT 1140 

CACGACCAGG TCGGCAACCG AGCGGTCGGC GACCGCATGT CGGCGCAGGT CGGCGAGGGT 1200 

TCGCTCGCCG CCGCGGCbGC GCTCGTGCTG CTCGGCCCGT TCACCCCGAT GCTGTTCATG 1260 

GGCGAGGAGT GGGGCGCGCG CACCCCGTGG CAGTTCTTCA CCTCCCACCC CGAGCCCGAG 1320 

CTGGGGGAGG CGACGGCGCG CGGGCGCATC GCCGAGTTCG CCCGCATGGG CTGGGACCCG 1380 

GCAGTCGTGC CCGACCCGCA GGACCCGGCC ACCTTCGCCC GCTCGCACCT GGACTGGTCC 1440 

GAGCCCGAGC GGGAAOCGCA CGCGGGCCTG CTCGCCTTCT ACACCGACCT GATCGCGCTG 1500 

CGGCGCGAGC TGCcbGTCGA TGCGCCGGCG CGCGAGGTGG ATGCCGACGA GGCGCGCGGC 15 60 
GTCTTCGCGT TCAGCCGCGG CCCGCTGCGG GTCACGGTCG CGCTGCGCCC CGGACCGGTC 1620 
GGGGTGCCCG AGc'^GGGGG CCTCGTGCTC GCCTACGGCG AGGTGCGCGC CGGCGCCGCC 1680 

GGACTGCACC Tc6aCGGGCC GGGAGCCGCG ATCGTGCGCC TCGAG 1725 

( 18 ) INFORMATION FOR SEQ ID NO: 18: 
(i)SEQOT:NCE CHARACTERISTICS: 
( A)LENGTH: 23 
UB ) TYPE : nucleic acid 
X C ) strandedness : single 
/ji D ) TOPOLOGY : 1 inear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 18: 

GCSAACCGSff GGTGGTGGGA CGT 23 



20 
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( 19 ) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

( A) LENGTH: 3252 

( B ) TYPE : nucleic acid 

( C ) strandedness : double 
(D )TOPOLOGY: linear 

(ii) MOLECULE TYPE : genomic DNA 
(vi) ORIGINAL SOURCE: 

( A) ORGANISM: Arthrobacter si 

(B) INDIVIDUAL ISOLATE : S34/( FERM BP-6450) 
( ix) FEATURE: 

( A ) NAME/KEY : 5 ' UTR 

( B ) LOCATION : 1 . . 742 

( C ) IDENTIFICATION METtfOD : E 

( A) NAME/KEY:niat pept^/de 

( B ) LOCATION : 743 . . 30 J 

(C) IDENTIFICATION METHOD : E 
(xi)SEQUENCE DESCRIPTIOl^}/i SEQ ID NO: 19: 

ATGCCGACGA CGAACTTGAG CGCGTTCTC^^ GGCACCCGCG AGAGCGGTCC GCGCACGGCG 
GCGCCCAGTG CCACGACGAG CACGATCGCZG GCGAGCGCCG CGACGACGGC GACCGGCAGG 
CGCCCCTGAT TGCTGGCGAA GOTGAGCA^G ATGAAGACCA CCTCGAGGCC CTCGAGCAAC 
ACACCTTTGA ACGACACGGT GAACGCGTAC CAATCGGAGA CCCCGAACCG GCTCTCGCGC 
CGGGCGCTCT CGGCCGCCTC GACCTGACGC CGGAAGGCAG CCTCCTCGTC ACGGAGAGCC 
CTGCGCCCTG CCGCGCGCAG CACCGCCTTG CGCAGCCAGC CGAGCCCGAA GACGAGCAGC 
AACCCGCCGA CGACGAGGCG CAGC^GGCC AGCGGCAGCA GCAGGATCGC GGGACCGACG 
AGCGCGACGG CCGCGGCCAG CACCACCACG GCGACGGCGG CACCTGTCAG CGCCGACCGC 
CAGCTGCGGG TGGCGCCGAC CGCGACGACG ATCGTGGTCG CCTCCACCGC CTCGACCACG 
CAGGCGAGGA ACACGGCGGC GA^CAGGGCG ACGGCGGTCA TCGGCCCAGC AGACGGTTGA 
CCATCACGGC ACGCTAGCGC CATTGCTCAC AGGAAGGGCC AAGACGCCCG CAACGCGGCA 
CCCGTGGACG GCGCGTACCG GCGTGTGACC GATCGTGTCA ACCGGTGGCG CCCGCCCCGA 
GCACCTGCGT AGATTCGGCC JC GTG CCC GCC AGT ACC TAC CGC CTT CAG ATC 

Met Pro Ala Ser Thr Tyr Arg Leu Gin lie 
15 10 
zJtC TTC GAC GCG GCG CGC ATC GTG CCC TAC CTG 
/Leu Phe Asp Ala Ala Arg lie Val Pro Tyr Leu 

20 25 
GAC TGG CTG TAC CTC TCG CCG CTG CTC GAG TCC 
Asp Trp Leu Tyr Leu Ser Pro Leu Leu Glu Ser 
35 40 
/CG CAC GGC TAC GAC GTG GTC GAC CAC TCC CGC GTC 
5er His Gly Tyr Asp Val Val Asp His Ser Arg Val 



TCG GCG GAG TTC ACC 
Ser Ala Glu Phe Thr 

15y 

CAC CGC CTC GGC GC^ 
His Arg Leu Gly A] 
30 

GAG TCG GGC TCC 
Glu Ser Gly Ser 

45 / 50 55 

GAC GCC GCC CGC/ GGC GGG CCG GAG GGG CTC GCC GAG CTC TCC CGT GCG 
Asp Ala Ala Ard Gly Gly Pro Glu Gly Leu Ala Glu Leu Ser Arg Ala 

65 70 
CdC GGC ATG GGC GTC GTC GTC GAC ATC GTG CCC AAC CAC 
Arg Gly Met Gly Val Val Val Asp lie Val Pro Asn His 
80 85 90 

^CG ACG CCG AAG GCG AAC CGC TGG TGG TGG GAC GTT CTG 
Via Thr Pro Lys Ala Asn Arg Trp Trp Trp Asp Val Leu 
95 100 105 

GCC CGT GGa/cAG CGG TCG GAG TAC GCC GAC TAC TTC GAC ATC GAC TGG 
Ala Arg Gly/ Gin Arg Ser Glu Tyr Ala Asp Tyr Phe Asp lie Asp Trp 
110 115 120 



GAG 
Glu 



60 

GCG CAC 
Ala His 
75 

GTC GGC GTC 
Val Gly Val 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
772 



820 
868 
916 
964 
1012 
1060 
1108 
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7 



i 



GAC CAC GAC 
Asp His Asp 



GAG TTC GGC GGC GGC AGG CTG CGC CTG CCC GTG CTC/GGC GAC GGC CCC 1156 
Glu Phe Gly Gly Gly Arg Leu Arg Leu Pro Val Leu Gly Asp Gly Pro 

125 130 / 135 

GAC GAG CTC GAC GCG CTG AGA GTG GAT GGC GAC >GAG CTC GTC TAC TAG 1204 
Asp Glu Leu Asp Ala Leu Arg Val Asp Gly Aspf Glu Leu Val Tyr Tyr 

140 145 / 150 

GAG CAC CGC TTC CCG ATC GCC GAG GGC ACC OGC GGC GGC ACC CCG CGC 12 52 
Glu His Arg Phe Pro lie Ala Glu Gly Thr/Gly Gly Gly Thr Pro Arg 
155 160 / 165 170 

GAG GTG CAC GAC CGG CAG CAC TAC GAG CTG ATG TCG TGG CGG CGG GCC 1300 
Glu Val His Asp Arg Gin His Tyr Glu Lfeu Met Ser Trp Arg Arg Ala 

175 /ISO 185 

CTC AAC TAC CGC CGC TTC^ TTC GCC GTG AAC ACG CTC GCC 1348 
Leu Asn Tyr Arg Arg Phfe Phe Ala Val Asn Thr Leu Ala 
190 1^5 200 

GCC GTA CGC GTC GAA GAC CCG CGC (5TG TTC GAC GAC ACC CAC CGC GAG 139 6 
Ala Val Arg Val Glu Asp Pro Arg/val Phe Asp Asp Thr His Arg Glu 

205 2ia 215 

ATC GGC CGC TGG ATC GCC GAG GGC CTC GTC GAC GGC CTG CGC GTC GAC 1444 
lie Gly Arg Trp lie Ala Glu GjLy Leu Val Asp Gly Leu Arg Val Asp 

220 225 / 230 

CAC CCC GAC GGG CTG CGC GCC /CCC GGC GAC TAC CTG CGC CGT CTC GCC 1492 
His Pro Asp Gly Leu Arg Ala/ Pro Gly Asp Tyr Leu Arg Arg Leu Ala 
235 240 / 245 250 

GAG CTC GCC CAA GGC AGG COG ATC TGG GTC GAG AAG ATC ATC GAG GGC 1540 
Glu Leu Ala Gin Gly Arg P>to lie Trp Val Glu Lys lie lie Glu Gly 

255 / 260 265 

ATG CCC CCG yCAG TGG CCC ATC GCC GGC ACC ACC GGC TAC 1588 
Met Pro Prey Gin Trp Pro lie Ala Gly Thr Thr Gly Tyr 
270 / 275 280 

GCC GGG ATC GAC CGG GTG CTC GTC GAC CCC GCG GGC GAG 1636 
Ala Gly lie Asp Arg Val Leu Val Asp Pro Ala Gly Glu 
/ 290 295 

AtC GTC GAC GAG GCG GCA GGC AGC CCC CGG CGC 1684 
'lie Val Asp Glu Ala Ala Gly Ser Pro Arg Arg 

305 310 
CCC GAG CGC AAG CGG GCC GTC GCC CGC GGC ATC 1732 
Pro Glu Arg Lys Arg Ala Val Ala Arg Gly lie 
320 325 330 

CGC CGC GTC GCC CGC GAA CTC GGA GAG GTC GCC 1780 
Arg Arg Val Ala Arg Glu Leu Gly Glu Val Ala 

340 345 
GCG CTC GTC GAG ATC GCC GCC GCC CTG TCC GTC 1828 
Ala Leu Val Glu lie Ala Ala Ala Leu Ser Val 
355 360 
TAC CGC AGC T^fc CTG CCG TTC GGG CGC GAG CAC CTC GAC GAA GCC GTG 1876 
Tyr Arg Ser Tyr Leu Pro Phe Gly Arg Glu His Leu Asp Glu Ala Val 

365 / 370 375 

GCC GCC GCG GAG GCC GCA GCC CCC CAG CTC GAG GCC GAC CTC GCC GCC 1924 
Ala Ala Ala Gin Ala Ala Ala Pro Gin Leu Glu Ala Asp Leu Ala Ala 

380 / 385 390 

GTC GGC GCA /GCG CTC GCC GAC CCG GGC AAC CCC GCC GCG CTC CGC TTC 1972 
Val Gly Ala /Ala Leu Ala Asp Pro Gly Asn Pro Ala Ala Leu Arg Phe 
395 / 400 405 410 

CAG CAG ACC/ AGC GGC ATG ATC ATG GCC AAG GGC GTC GAG GAC AAC GCG 2020 
Gin Gin Thi/ Ser Gly Met lie Met Ala Lys Gly Val Glu Asp Asn Ala 



GAC GAG CGG 
Asp Glu Arg 



GAC GCG 
Asp Ala 



CAT 
His 

TGG 
Trp 
315 
CTG 
Leu 



CCG 
Pro 
300 
GCC 
Ala 

AAC 
Asn 



CTG 
Leu 
285 
CTC 
Leu 

GAG 
Glu 

TCC 
Ser 



ACC CAG 
Thr Gin, 

CTG GT' 
Leu Vgfl 

GAG 
Glu 



GGC GAC GTC 
Gly Asp Val 
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# 



TTC TAG 
Phe Tyr 

CCG AGC 
Pro Ser 



GAG 
Asp 

GAG 
Asp 
475 
GAG 
Glu 



GGG 
Arg 
460 
AGG 
Thr 

GGG 
Ala 



GGG 
Arg 

GTG 
Leu 
445 
GGG 
Ala 



TAG 
Tyr 
430 
TTG 
Phe 

GGG 
Ala 



415 

GGG GGG 
Pro Arg 

GGG ATG 
Ala lie 

GGG GTG 
Arg Leu 



AAG GGG 
Lys Arg 

GGG GAA 
Pro Glu 



ATG GGA 
lie Gly 

GTG GGG 
Val Gly 



AAG 

Lys 

GGG 
Pro 
555 
GAG 
Glu 

GGG 
Ala 

GGG 
Ala 

TGG 
Ser 

TGG 
Ser 
635 
GAG 
Glu 



GGG 
Ala 
540 
GGG 
Ala 

GGG 
Pro 

GGG 
Ala 

GGG 
Pro 

GTG 
Leu 
620 
GAG 
Glu 



AGG 
Thr 

GGG 
Ala 
525 
GGG 
Ala 



GGG 
Gly 
510 
TGG 
Trp 

GGG 
Arg 



AGG 
Ser 

GGG 
Arg 
495 
GAG 
Asp 



GAA 
Glu 
480 
TGG 
Trp 

GGG 
Arg 



GGG GGG 
Pro Ala 

GAA GGG 
Glu Ala 



TTG GAA 
Phe Glu 

GTG GTG 
Leu Val 

GGG TAG 
Gly Tyr 
590 
GGA AGG 
Gly Thr 
605 

GTG GAG 
Val As 

GTG Gl 
Leu 



GAG GGG 
Glu Arg 

- 56 
GAG GA 
His Giu 
575 

TGG AaG 
Ser/ Asn 

G^G GAG 
v/ro Asp 



GTG AGG 
Leu Thr 

GAG GGG 
Asp Ala 
450 
GGG GAG 
Pro Glu 
465 

GAG AGG 
Asp Thr 

GGG GGG 
Arg Arg 

GTG GTG 
Val Leu 

AGG 
Ser 

'30 
GGG/ GAG 
Gl/ Glu 
5/5 

(TG AGG 
^Leu Thr 



420 / -^25 

TGG GTG AGG GAG GTG/GGG GGA GAG 



Ser Leu Thr Glu 
435 

GGG GGG TTG GAG 
Ala Ala Phe 



TGG ATG AGG 
Ser Met: Th 



Gly Gly Asp 
440 

GG GGG GAG GGG 
Ala Ala Gin Arg 
455 

GG GTG AGG AGG GAG 
Thr Leu Thr Thr His 
470 

GGG GGG j6gG ATG AGG GGG GTG GGG 
Arg Ala^ Arg lie Thr Ala Leu Ala 
485 490 
AGG GAG GTG GGG GGG GTG 
Leu Thr Glu Val Gly Gly Leu 
500 505 
AAG GTG ATG TGG GAG GGG ATG 

lie 



TTG 
Phe 



?TG 



;iu Asn Leu lie Trp Gin Ala 



GTG GTG 
Leu Leu 

GGG GTG 
Gly Leu 

GTG TAG 
Val Tyr 



fCCG GAG 
Pro Asp 



AGC GGG 
Thr Gly) 



Ale 



GGG GGC 
Arg Arg 

GGG GGC 
Arg Gly 

ATG GGC 
He Ala 
700 
TGG GGCl 



GGG 
Arg 
670 

:g gag 

La Gin 
i85 
/GTG GGG 
'Leu Ala 



GGG GAG 
Pro Glu 

GGG GAG 
Ala Glu 

AGG GGG 
Thr Arg 



GAG AGG GTG GTG 



515 520 
GAG GGG GTG GAG GGG TAG GGG GTG 
Glu Arg Leu Glu Ala Tyr Ala Leu 

535 

TGG AGG GAG TGG ATG GAG GGG GAG 
Ser Thr Asp Trp He Asp Gly Asp 
550 

GGG GTG GTG AGG GTG GGG GTG GAG 
Arq Leu Val Thr Val Ala Val Glu 
565 570 
GAG GGG GTG GTG GAG GAG GTG AGG 
Glu Arg Leu Val Asp Glu Leu Thr 

580 585 
GGG GGG AAG GTG GTG GAG GTG GTG 
Ala Ala Lys Leu Leu Gin Leu Leu 
595 600 
GAG GGG AGG GAA GGG TGG GAG GGG 
Gin Gly Thr Glu Arg Trp Asp Arg 

615 

GGG GGG GTG GAT TTG GGG GGG GGA 
: Arg Pro Val Asp Phe Ala Ala Ala 
630 

; GGG GGG TGG GGG GGG GGG GTG GAG 
) Gly Gly Trp' Arg Pro Pro Val Asp 
645 650 
GTG GTG TGG GGG GGG GTG GGG CTG 
Val Val Ser Arg Ala Leu Arg Leu 

660 665 
AGG GGG TAG GAG GGG GTG AGG GCG 
Thr Ala Tyr His Pro Val Thr Ala 
675 680 
GAG GTG ATG GGG TTG GAG GGG GGG GGC GCG 
His Leu He Gly Phe Asp Arg Gly Gly Ala 

690 695 
CTG GGG GTG GGG GTG GGC GCG GGA GGC GGC 
Leu Pro Leu Gly Leu Ala Ala Ala Gly Gly 
705 710 

GAG GTG GGG GAG GGG AGG GTG GGG GAC GAG 



AGG CTG 
Thr Leu 

GTG TTG 
Leu Phe 



2068 
2116 
2164 
2212 
2260 
2308 
2356 
2404 
2452 
2500 
2548 
2596 
2644 
2692 
2740 
2788 
2836 
2884 
2932 



- 11 - 



Trp Gly Asp Thr Val Val Asp Val Gly Glu Arg Ser Leu Arg Asp Glu 

*7 1 ^ 720 725 

cil ACC GGC CGC GAG GCC CGC GGA GCG GCG CGC GTG GCC GAG TTG ttC 2980 
Leu Thr Gly Arg Glu Ala Arg Gly Ala Ala Arg Val Ala Glu Leu Phe 

735 740 
GCC GAC TAC CCC GTC GCC CTG CTG GTG GAG ACA TGAACCGACG ATTCCC^C 3033 
Ala Asp Tyr Pro Val Ala Leu Leu Val Glu Thr 

TGGGCGCCCC AGGCCGCGCA GGTGACGCTC GTCGTGGGCC AAGGCCGCGC C^ACTCCCG 3093 
C?GACCCG^^ ACGAGAACGG ATGGTGGGCT CTTCAGCAGC CGTGGGACGG/6gGCCC^^^ 
CTcStCGACT ACGGCTACCT CGTCGACGGC AAGGGCCCCT TCGCCGACCCf GCGGTCGCTG 3213 
CGGCAGCCGC GCGGCGTGCA CGAGCTCGGC CGCGAATTC ^ ^^^^ 

(20) INFORMATION FOR SEQ ID NO: 20: 
(i)SEQUENCE CHARACTERISTICS: 

(A) LENGTH:26 

( B ) TYPE : nucleic acid 

( C ) strandedness : single 

( D ) TOPOLOGY : linear 
{xi)SEQUENCE DESCRIPTION : SEQ IDyJ^O:20 

ATGCCCGCCA GTACCTACCG CCTTCA 

(21 ) INFORMATION FOR SEQ ID NO: 2! 
(i)SEQUENCE CHARACTERISTIC' 

(A) LENGTH:25 

( B ) TYPE : nucleic aci 
( C ) strandedness : si/igle 
(D)TOPOLOGY: lineal* 

(xi)SEQUENCE DESCRIPTION : SEQ ID NO: 21: 

TCATGTCTCC ACCAGCAGGG Cg/cG 

(22) INFORMATION FOR SEQ ID NO: 22: 
(i)SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 5/3 

(B) TYPE:nutf5leic acid 

( C ) strandedness : single 

(D) TOPOLQfcY: linear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 22: 

AATTCTTTTT TAATA^ATC AGGAGGAATC TAGATGTTTA CTAGTCTGCA 

(23) information/ FOR SEQ ID NO: 23: 
(i)SEQUENCE CHARACTERISTICS: 

( A) yENGTH:42 

(B) TYPE: nucleic acid 
( C if strandedness : single 
(m TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 23: 

gactagtaaa/catctagatt CCTCCTGATT TTATTAAAAA AG 

(24) INFORMATION FOR SEQ ID NO: 24: 
( i ) SEQUENCE CHARACTERISTICS : 
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( A) LENGTH:33 

(B) TYPE: nucleic acid 

( C ) strandedness : single 

( D ) TOPOLOGY : linear 
(xi)SEQUENCE DESCRIPTION : SEQ 

AAATCTAGAT GCCCGCCAGT ACCTACCGCC 



(25) INFORMATION FOR SEQ ID NO: 25: 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH:33 

( B ) TYPE : nucleic acid 

( C ) strandedness : single 

( D ) TOPOLOGY : linear 
(xi) SEQUENCE DESCRIPTION : SEQ 

AAAACTAGTT TATCATGTCT CCACCAGCAG/ GGC 




33 



NO: 25 : 



33 



(26) INFORMATION FOR SEQ ID NO:^5: 

{i)SEQUENCE CHARACTERIST;ECS : 

( A) LENGTH: 22 

( B ) TYPE : nucleic ao^d 
( C ) strandedness : aa.ngle 
(D)TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPt/oN : SEQ ID NO: 26 

ATCGGTGATG TCGGCGATAT AG/ 

(27) INFORMATION FOR SE^ ID NO: 27: 

(i)SEQUENCE CHARACTERISTICS: 

( A) LENGTH:2i 

( B ) TYPE : nualeic acid 

( C ) strandedness : single 
(D)TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 27: 

GTACTGGCGG GCATATTTTT TCCTCCTGA 

(28) INFORMATION FOR SEQ ID NO:28: 

(i)SEQUENCe CHARACTERISTICS: 

(A) LEJ^GTH:31 

( B ) tVpe : nucleic acid 

( C )s3crandedness : single 
( D ) TOPOLOGY : linear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 28: 



22 



29 



AATCAGGAGG 



^AATATG CCCGCCAGTA C 



31 



( 29 ) INFORMATION FOR SEQ ID NO: 29: 
(i)SEQUENCE CHARACTERISTICS: 
/a) LENGTH: 22 
; B )TYPE : nucleic acid 
/( C ) strandedness : single 
/(D)TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 29 
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TCGACGATCT GGGTGAGCGG AT 

(30) INFORMATION FOR SEQ ID NO: 30: 

'(i)SEQUENCE CHARACTERISTICS: 
. (A)LENGTH:22 

( B ) TYPE : nucleic acid 

( C ) strandedness : single 

( D ) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 30: 

TCGACGAGCA CCCGGTCGAT CC 

( 31 ) INFORMATION FOR SEQ ID NO: 31: 

(i)SEQUENCE CHARACTERISTICS: 

(A) LENGTH:26 

( B ) TYPE : nucleic acid 

( C ) strandedness : single 

( D ) TOPOLOGY : linear 
(xi) SEQUENCE DESCRIPTION : SEQ it) NO: 31 

CARTGGGAYG AYGAYGTNCA YCAYGC 

f (32 ) INFORMATION FOR SEQ ID NOy62: 

( i ) SEQUENCE CHARACTERISTICS : 

( A) LENGTH: 2218 

( B ) TYPE : nucleic a^id 

( C ) strandedness : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : gej4omic DNA 
(vi) ORIGINAL SOURCE 

( A) ORGANISM: Aa^hrobacter sp. 

(B) INDIVIDUAL/ ISOLATE :S34 ( FERM BP-6450) 
( ix) FEATURE: 

( A) NAME/KEY:iaa-t peptide 

( B ) location/477 . . 2201 

(C) IDENTIFICATION METHOD : E 

(A) NAME/KEY:3S 'UTR 

(B) LOCATI0N:22O2. .2218 

(C) IDENTOT'ICATION METHOD : E 
(xi) SEQUENCE pESCRIPTION : SEQ ID NO: 32 



22 



22 



26 



CTGCAGCTGC 
TCGCTGGTGG 
GACCGCCTCG 
GTCGTCTCCC 
CCGGTCACGG 
ATCGCCCTGG 
GTCGTCGACG 
GCGGCGCGCG 



TCGCCCCQGG 
ACCCGGA^AA 
ACGGCGGCTG 
GCGCGCTGCG 
CGCGCQGCGC 
CCACCCGCCT 
TCGGQGAGCG 
TGGCgiGAGTT 



AACCCCCGAC 
CCGTCGCCCC 
GCGGCCGCCC 
GCTGCGCCGC 
GCAGGCCGAG 
GCCGCTCGGC 
GAGCCTGCGC 
GTTCGCCGAC 



GTGTACCAGG 
GTGGATTTCG 
GTCGACGAGA 
GACCGGCCCG 
CACCTGATCG 
CTCGCCGCCG 
GACGAGCTGA 
TACCCCGTCG 



AAC CGA CGA TTC CCG GTC TGG GCG CCC CAG GCC 

Asn Arg Arg Ph/e Pro Val Trp Ala Pro Gin Ala 

'5 10 

GTC GTG GGC CAA GGC CGC GCC GAA CTC CCG CTG 

Val Val Gly Gin Gly Arg Ala Glu Leu Pro Leu 
20 7 25 



GCACGGAACG CTGGGACCGG 
CCGCGGCATC CGAGCTGCTC 
CCGGCGCGGT CAAGACGCTC 
AGCTGTTCAC CGCGTACCAC 
GCTTCGACCG CGGCGGCGCG 
CAGGCGGCTG GGGCGACACG 
CCGGCCGCGA GGCCCGCGGA 
CCCTGCTGGT GGAGAC ATG 

Met 
1 

GCG CAG GTG ACG CTC 
Ala Gin Val Thr Leu 
15 

ACC CGC GAC GAG AAC 
Thr Arg Asp Glu Asn 
30 



60 
120 
180 
240 
300 
360 
420 
479 



527 



575 
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GGA 
Gly 

GAG 
Asp 
50 
TOG 
Ser 



TGG 
Trp 
35 
TAG 
Tyr 

CTG 
Leu 



TGG GCT 
Trp Ala 

GGC TAG 
Gly Tyr 

CGG GAG 
Arg Gin 



CTT 
Leu 

CTG 
Leu 



GAG 
Gin 



GAG 
Gin 
40 
GAG 
Asp 



CCG TGG GAG GGC 
Pro Trp Asp Gly 



GTC GTC 
Leu Val 



GGC GGC 
Pro Ala 

ACC GGA 
Thr Gly 

GGA AGG 
Gly Thr 
115 
GGC GTC 
Gly Val 
I 130 
CAC GGC 
His Gly 

TAG GGC 
Tyr Gly 




GGC GGC 
Arg Gly 



GTG ACC 
Val Thr 



GTC GGC 
Val Gly 



TAG TAG 
Tyr Tyr 



CAC 
His 
315 
GCC 
Ala 



GAG 
Asp 



GCG GTG 
Ala Val 

TTC GGC 
Phe Gly 



GAC GAG 
Asp Glu 



GAG AGC GAC 
Glu Ser Asp 

GGC TAG GGC 
Gly Tyr Gly 
305 

CAC GCC AAC 
His Ala Asn 
320 

GGG CTG GGC 
Gly Leu Gly 



623 
671 
719 
767 
815 
863 
911 
959 
1007 
1055 
1103 
1151 
1199 
1247 
1295 
1343 
1391 
1439 
1487 
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TTC 
Phe 

CGG 
Arg 

GTC 
Val 
375 
GAG 
Asp 

GCG 
Ala 

GAG 
Glu 

CCC 
Pro 

CGC 
Arg 
455 
AGO 
Thr 

CAC 
His 



CAG 
Gin 

CAC 
His 
360 
GCC 
Ala 



CGC 
Arg 
345 
CAC 
His 

TTC 
Phe 



330 

GGC TGG 
Gly Trp 

GGC CGG 
Gly Arg 

GCG CAG 
Ala Gin 



CGC ATG 
Arg Met: 

CTC GTG 
Leu Val 

TGG GGC 
Trp Gly 
425 
GAG CTG 
Glu Leu 
440 
ATG GG 
Met G 




325 

GCC CTC GTC AAG GTG 
Ala Leu Val Lys Val 
340 

TCG AGC TTC CGC GAG 
Ser Ser Phe Arg Glu 
355 

CCG TTC CGC CGG CTC 
Pro Phe Arg Arg Leu 
370 

AAC CGA GCG GTC GGC 
Asn Arg Ala Val Gly 

390 

CTC GCC GCC GCG GCG 
Leu Ala Ala Ala Ala 
405 

CTG TTC ATG GGC GAG 
Leu Phe Met: Gly Glu 
• 420 

\ ACC TCC CAC CCC GAG 
Thr Ser His Pro Glu 
435 

ATC GCC GAG TTC GCC 
lie Ala Glu Phe Ala 
450 

CCG CAG GAC CCG GCC 
Pro Asp Asp Pro Ala 

470 

CCC GAG CGG GAA CCG 
Pro Glu Arg Glu Pro 
485 

ATC GCG CTG CGG CGC 
lie Ala Leu Arg Arg 
500 

GAT GCC GAC GAG GCG 
Asp Ala Asp Glu Ala 
515 

CGG GTC ACG GTC GCG 
Arg Val Thr Val Al 
530 

GGG GGC CTC GTG C^G 
Gly Gly Leu Val l4u 

50 

CTG CAC CTC GAC hcG 
Leu His Leu Asp/ Gly 

56" 

TGACGCGGCT GGG^'ACC 



(33) INFORMATION FOR SEQ ID NO: 33: 
(i)SEQUHNCE CHARACTERISTICS: 
(aA LENGTH: 25 
( H ) TYPE : nucleic acid 

( C ) strandedness : single 

(D) TOPOLOGY: linear 
(xi)SEQUENCE DESCRIPTION : SEQ ID NO: 33: 



1535 

1583 

1631 

1679 

1727 

1775 

1823 

1871 

1919 

1967 

2015 

2063 

2111 

2159 

2201 

2218 
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m # 




ATGAACCGAC GATTCCCGGT CTGGG / ^5 

(34) INFORMATION FOR SEQ ID NO: 34: 
(i)SEQUENCE CHARACTERISTICS: 

( A) LENGTH:25 

( B ) TYPE : nucleic acid 

( C ) strandedness : single 

( D ) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 34: 

TCACTCGAGG CGCACGATCG CGGCT / 

(35) INFORMATION FOR SEQ ID NO: 35: 
(i)SEQUENCE CHARACTERISTICS: 

(A) LENGTH:36 

( B ) TYPE : nucleic acid 

( C ) strandedness : single 

( D ) TOPOLOGY : linear 
(xi) SEQUENCE DESCRIPTION : SEQ I 

^AAATCTAGAT GAACCGACGA TTCCCGGTCT /GGGCGC 36 

(36 ) INFORMATION FOR SEQ ID NO : ^ : 
(i)SEQUENCE CHARACTERIST^S: 

( A) LENGTH:36 

( B ) TYPE : nucleic 

( C ) strandedness 

(D ) TOPOLOGY: line 
(xi) SEQUENCE DESCRIPT 

AAAACTAGTT TATCACTCGA GGCGCACGAT CGCGGC 36 

(37) INFORMATION FOR SEQ ID NO: 37: 

(i)SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE:nix6leic acid 

( C ) strandedness : single 

( D ) TOPOLOGY : linear 
(xi)SEQUENCE DESCRIPTION : SEQ ID NO: 37: 

ATCGTCGGTT CATATTTTTT CCTCCTGA 

(38) INFORMATIOn/fOR SEQ ID NO: 38: 

(DSEQUENOE CHARACTERISTICS: 

(A) L^NGTH:28 

( B ) TYPE : nucleic acid 

( C )st:randedness : single 
(D /topology: linear 
(xi )SEQU/ENCE DESCRIPTION : SEQ ID NO: 38: 

AATCAGGAGG ^AAAATVATATG AACCGACG 

(39 ) INFORMATION FOR SEQ ID NO: 39: 

( i ) SEQUENCE CHARACTERISTICS : 
( A)LENGTH:22 
/ (B)TYPE: nucleic acid 



- 17 - 



